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<120> POST HARVEST CONTROL OF GENETICALLY MODIFIED CROP GROWTH EM- 
10 • PLOYING D- AMINO ACID COMPOUNDS 

<130> PF 55447 EP 



15 



<160> 16 

<170> Patentln version 3.1 



<210> 1 

20 <211> 1160 

<212> DNA 

<213> Rhodosporidium toruloides 



25 



<220> 
<221> 
<222> 
<223> 



CDS 

(1)..(1104) 
coding for DAAO 



<400> 1 
30 atg cac teg cag 
Met His Ser Gin 
3. 

ctg age age gee 
Leu Ser Ser Ala 
35 20 
etc gcg cgc gac 
Leu Ala Arg Asp 
35 

cca tgg get ggc 
40 Pro Trp Ala Gly 

50 . 

cct cga caa gca 
Pro Arg Gin Ala 
65 

45 ttg gtc ccg acg 
Leu Val Pro Thr 



aag cgc gtc gtt 
Lys" Arg Val Val 
5 

etc ate etc get 
Leu lie Leu Ala 



ttg ccg 
Leu Pro 

gcg aat 
Ala Asn 

aaa tgg 
Lys Trp 

70 
ggc cat 
Gly His 
85 



gag gac 
Glu Asp 

40 
tgg acg 
Trp Thr 
55 

gaa gaa 
Glu Glu 

gec atg 
Ala Met 



gtc etc gga tea 
Val Leu Gly Ser 
10 

egg aag ggc tac 
Arg Lys Gly Tyr 
25 

gtc teg age cag 
Val Ser Ser Gin 

cct ttc atg acg 
Pro Phe Met Thr 
60 

teg act ttc aag 
Ser Thr Phe Lys 
75 

tgg etc aag ggg 
Trp Leu Lys Gly 
90 



ggc 
Gly 

age 
Ser 

act 

Thr 

45 

ctt 

Leu 



gtt 
Val 

gtg 

Val 

30 

ttc 

Phe 



ate 

lie 

15 

cat 

His 

get 
Ala 



aca gac 
Thr Asp 



ggt 
Gly 

att 
lie 

tea 
Ser 

ggt 
Gly 



aag tgg gtc gag 
Lys Trp Val Glu 
80 

acg agg egg ttc 
Thr Arg Arg Phe 
95 



48 



96 



14 4- 



192 



240 



288 
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gcg cag aac gaa gac ggc ttg etc ggg cac tgg tac aag gac ate acg 336 
Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp He Thr 

100 105 110 

cca aat tac cgc ccc etc cca tct tec gaa tgt cca cct ggc get ate 384 
5 Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala He 
115 120 125 

ggc gta acc tac gac acc etc tec gtc cac gca cca aag tac tgc cag 432 
Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 
130 135 140 

10 tac ctt gca aga gag ctg cag aag etc ggc gcg acg ttt gag aga egg 480 
Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 
145 150 155 160 

acc gtt acg teg ctt gag cag gcg ttc gac ggt gcg gat ttg gtg' gtc 528 
Thr Val Thr Ser Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 
15 165 170 175 

aac get acg gga ctt ggc gee aag teg att gcg ggc ate gac gac caa 57 6 

Asn Ala Thr Gly Leu Gly Ala Lys Ser He Ala Gly He Asp Asp Gin 

180 185 190 

gec gec gag cca ate cgc ggg caa acc gtc etc gtc aag tec cca tgc 624 
20 Ala Ala- Glu Pro He Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
. 195. 200 205 

aag cga tgc acg atg gac teg tec gac .ccc. get tct ccc gee tac ate 672 
Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr He 
210 215 220 

25 att ccc cga cca ggt ggc gaa gtc ate tgc ggc ggg acg tac ggc gtg • 720 
He Pro Arg Pro Gly Gly Glu Val He Cys Gly Gly Thr Tyr Gly Val 
225 230 * 235 240 

gga gac tgg gac ttg tct gtc aac cca gag acg gtc cag egg ate etc 768 
Gly Asp Trp Asp Leu Ser Val Asn Pro Glu Thr Val Gin Arg He Leu 
30 . 245 250 • - -255 

aag cac tgc ttg cgc etc gac ccg acc ate teg age -gac gga acg ate 816 
Lys His Cys Leu Arg Leu Asp Pro Thr He Ser Ser Asp Gly Thr lie 

260 265 270 

gaa ggc ate gag gtc etc cgc cac aac gtc ggc ttg cga cct gca cga . 8 64 

35 Glu Gly He Glu Val Leu Arg His Asn Val Gly Leu Arg Pro Ala Arg 
275 280 285 

cga ggc gga ccc cgc gtt gag gca gaa egg ate gtc ctg cct etc gac 912 
Arg Gly Gly Pro Arg Val Glu Ala Glu Arg He Val Leu Pro Leu Asp 
290 295 300 

40 egg aca aag teg ccc etc teg etc ggc agg ggc age gca cga gcg gcg 960 
Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 
305 310 315 320 

aag gag aag gag gtc acg ctt gtg cat gcg tat ggc ttc teg agt gcg 1008 
Lys Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 
45 325 330 335 

gga tac cag cag agt tgg ggc gcg gcg gag gat gtc gcg cag etc gtc 1056 
Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 
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340 345 350 

gac gag gcg ttc cag egg tac cac ggc gcg gcg egg gag teg aag ttg 1104 
Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 

355 360 365 

tagggeggga tttgtggctg tattgeggge atctacaaga aaaaaaaaaa aaaaaa 1160 

<210> 2 

<211> 368 

<212> PRT 

<213>* Rhodosporidium toruloides 



<400> 2 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 
15 10 15 

Leu Ser Ser Ala Leu lie Leu Ala Arg Lys Gly Tyr Ser Val His lie 
20 25 30 

Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
35 40 45 

Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 
50 55' 60 

Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu. 
65 70 75 80 

Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Arg Phe 
85 90 95 

Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp He Thr 
100 105 110 

Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala He 
115 120 125 



Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 
130 135 140 

Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 

145 150 155 160 

Thr Val Thr Ser Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 

165 170 175 



Asn Ala Thr Gly Leu Gly Ala Lys Ser He Ala Gly He Asp Asp Gin 
180 185 190 



f • • 
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Ala Ala Glu Pro lie Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
195 200 205 

5 Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr lie 
210 215 220 

lie Pro Arg Pro Gly Gly Glu Val lie Cys Gly Gly Thr Tyr Gly Val 
225 230 235 240 

10 

Gly Asp Trp Asp Leu Ser Val Asn Pro Glu Thr Val Gin Arg lie Leu 
245 250 255 

Lys His Cys Leu Arg Leu Asp Pro Thr lie Ser Ser Asp Gly Thr lie 
15 260 265 270 

Glu Gly He Glu Val Leu Arg His Asn Val Gly Leu Arg Pro- Ala Arg 
275 280 285 

20 Arg Gly Gly Pro Arg Val Glu Ala Glu Arg He Val Leu Pro Leii Asp 
290 295 300 

Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 
305 310 315 320 

25 

Lys Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 
325 330 335 

Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 
30 340 345 ' 350 



Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 
355 360 365 

35 <210> 3 

<211> 1005 
<212> DNA 

<213> Caenorhabditis elegans 

40 <220> 

<221> CDS 

<222> (1)..(1002) 

<223> coding for DAAO 

45 <400> 3 

atg gca aac ata att ccg aag att gca att ate ggc gaa gga gtc att 4 8 

Met Ala Asn He He Pro Lys He Ala He He Gly Glu Gly Val He 



96 
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15 10 15 

gga tgt act tea gca ctt caa ata tea aaa get ata cca aat gcg aaa 
Gly Cys Thr Ser- Ala Leu Gin lie Ser Lys Ala He Pro Asn Ala Lys 
20 25 30 

5 ata act gtg etc cac gat aaa cca ttt aaa aaa teg tgc agt gca gga 144 
He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 

35 40 45 

cca gca gga tta. ttt aga ate gat tat gag gag aat act gaa tac gga 192 
Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 

10 50 55 60 

cgt get tct ttc gec tgg ttc tea cat etc tat cgc act aca aaa gga 240 

Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 

65 70 * 75 80 

tec gaa ace ggc gtg aaa tta gtt tct gga cat att caa tec gac aac 288 

15 Ser Glu Thr Gly Val Lys Leu Val Ser Gly His He Gin Ser Asp Asn 

85 90 95 

ttg gag tea ttg aag caa caa caa aga gee tat ggc gat att gtg tac 336 
Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp He Val Tyr 
10O ' 105 110 

20 aac ttt aga ttc ttg gat gat aga gaa egg ctg gac att ttt ccc gaa 384 
Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He Phe Pro Glu 

115 120 125 

cca tea aag cac tgc att cac tac ace gee tac gca tea gaa ggt aac 432 
Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser. Glu Gly Asn 

25 130 135 140 

aag tac gtg cct tat ttg aag aat ttg ctg ctt gag caa aaa ate gag 
Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys He Glu 
145 150 155 160 

ttc aag caa caa gaa gtg acg agt ttg gac gca gtc gee gac get ggt 

30 Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 

r 165 170 175 

tac gat gtt att gta aac tgc gca ggc ttg tac ggt gga aag ttg get 57 6 

Tyr Asp Val He Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 
180 185 190 

35 ggt gat gac gat act tgc tac ccc att aga gga gtc att ttg gaa gtt 624 
Gly Asp Asp Asp Thr Cys Tyr Pro He Arg Gly Val He Leu Glu Val 

195 200 205 

gat gca cca tgg cac aag cac ttc aat tat cga gac ttt act act ttc 672 
Asp Ala Pro Trp His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 

40 210 215 220 

aca att cca aaa gag cac age gtg gtg gtt ggg tec ace aag cag gac 720 

Thr He Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 

225 230 235 240 

aat cga tgg gat ttg gag ate acc gac gag gat' aga aat gat att ttg 7 68 

45 Asn Arg Trp Asp Leu Glu He Thr Asp Glu Asp Arg Asn Asp He Leu 

245 250 255 

aaa cga tac att get tta cat cct gga atg aga gag cca aag att ate 816 



480 



528 
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Lys Arg Tyr lie Ala Leu His Pro Gly Met Arg Glu Pro Lys lie lie 

260 265 270 

aaa gaa tgg tea gca ctt cgc ccg gga cgt aag cat gtc aga att gaa 864 
Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg lie Glu 
5 275 280 285 

gcg cag aag agg aca tct gtt gga aac tea aaa gat tat atg gtt gtg 912 
Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 

290 295 300 

cat cac tat ggt cac ggg age aac gga ttc acg ttg ggt tgg gga aca 960 
10 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

gca att gaa gca act aaa ctt gtt aag act gca eta gga tta taa 1005 
Ala lie Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 
325 330 

15 

<210> 4 
<211> 334 
<212> PRT 

<213> Caenorhabditis elegans 

20 

<400> 4 

Met Ala Asn lie lie Pro Lys lie Ala lie lie Gly Glu Gly Val He 
1 5 10 15 

25 Gly Cys Thr Ser Ala Leu Gin He Ser Lys Ala He Pro Asn Ala Lys 
20 25 30 

He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 
35- 40 45 



30 



Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 60 



Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
35 65 70 75 80 

Ser Glu Thr Gly Val Lys Leu Val Ser Gly His He Gin Ser Asp Asn 
85 90 95 

40 Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp He Val Tyr 
100 105 110 

Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He Phe Pro Glu 
115 120 ' 125 

45 

Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 
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Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys lie Glu 
145 150 155 160 

5 Phe Lys Gin Gin Glu Val Tfci.r Ser Leu Asp Ala Val Ala Asp Ala Gly 

165 170 175 ' 

Tyr Asp Val lie Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 
180 185 190 

10 

Gly Asp Asp Asp Thr Cys Tyr Pro lie Arg Gly Val lie Leu Glu Val 
195 200 205 

Asp Ala Pro Trp His Lys Hi_s Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
15 210 215 220 

Thr lie Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

20 Asn Arg Trp Asp Leu Glu He Thr Asp Glu Asp Arg Asn Asp lie Leu 

245 250 255 

Lys Arg Tyr lie Ala Leu H±s Pro Gly Met Arg Glu Pro Lys lie He 
260 265 270 

25 

Lys Glu Trp Ser Ala Leu Ar-g Pro Gly Arg Lys His Val Arg lie Glu 
275 280 285 

Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
30 290 295 300 

His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

35 Ala He Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 

325 330 

<210> 5 
<211> 1186 . 
40 <212> DNA 

<213> Nectria haematococca 

<220> 

<221> ' CDS 
45 <222> (42).. (1124) 

<223> coding for DAAO 
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<400> 5 

agcgacttga atttagcgaa aagaacttgt caaccacaat c atg tec aac aca ate 5 6 

5 Met Ser Asn Thr lie 

1 .5 

gtc gtc gtt ggt gec ggt gtc att ggc ttg acg teg gee ttg ttg etc 104 
Val Val Val Gly Ala Gly Val He Gly Leu Thr Ser Ala Leu Leu Leu 
10 15 20 

10 tec aag aac aag ggc aac aag ate acc gtc gtg gec aag cac atg ccc 152 
Ser Lys Asn Lys Gly Asn Lys He Thr Val Val Ala Lys His Met Pro 
25 30 35 

. ggc gac tat gac gtt gaa tac gec teg cct ttt get ggt gec aac cac 200 
Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe Ala Gly Ala Asn His 
15 40 45 50 

tec ccc atg gcg acg gaa gag age age gaa tgg gaa cgt cgc act tgg . 248 
Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp Glu Arg Arg Thr Trp 

55 60 65 

tac gag ttt aag aga ctg gtc gag gag gtc cct gag gee ggt gtt cat 296 
20 Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro Glu Ala Gly Val His 
70 75 80 ■ 85 

ttc cag aag tct cgc ate cag agg cgc aat gtg gac act gaa aag gcg 34 4 

Phe Gin Lys Ser Arg He Gin Arg Arg Asn Val Asp Thr Glu Lys Ala 
90 95 100 

25 cag agg tct ggt ttc cca gac gec etc ttc teg aaa gaa ccc tgg ttc 392 
Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser Lys Glu Pro Trp Phe 

105 110 115 

aag aac atg ttt gag gac ttc cgt gag cag cac cct age gag gtc ate 440 
Lys Asn Met Phe - Glu Asp Phe Arg Glu Gin His Pro Ser Glu Val He 
30 120 125 130 

ccc ggt tac gac tct ggc tgc gag ttc aca teg gtg tgc ate aac acg 488 
Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser Val Cys lie Asn Thr 

135 140 145 

gee ate tac etc ccc tgg etc etc ggc cag tgc ate aag aat ggc gtc . 536 
35 Ala He Tyr Leu Pro Trp Leu Leu Gly Gin Cys lie Lys Asn Gly Val 
150 155 i60 165 

ate gtc aag cgc gee ate etc aac gac att age gag gee aag aag ctg 584 
lie Val Lys Arg Ala lie Leu Asn Asp He Ser Glu Ala Lys Lys Leu 
170 175 180 

40 age cac gcg ggc aag acg ccc aat ate ate gtc aac gee acg ggt etc 632 
Ser His Ala Gly Lys Thr Pro Asn lie He Val Asn Ala Thr Gly Leu 

185 190 195 

ggc tec tac aag ctg ggc ggt gtc gag gac aag acc atg gcg cct gcg 680 
Gly Ser Tyr Lys Leu Gly Gly Val Glu -Asp Lys Thr Met Ala Pro Ala 
45 200 205 • 210 

egg gga cag att gtg gtt gtg cgc aac gag age age ccc atg etc etc 728 
Arg Gly Gin He Val Val Val Arg Asn Glu Ser Ser Pro Met Leu Leu 
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215 220 225 

act tea ggt gtc gag gac ggc ggt get gat gtc atg tac ttg atg cag 77 6 

Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val Met Tyr Leu Met Gin 
230 235 ■ 240 245 

5 cga gca get ggc ggt ggc ace ate ctg ggc ggt ace tac gac gtt ggc 824 
Arg Ala Ala Gly Gly Gly Thr lie Leu Gly Gly Thr Tyr Asp Val Gly 

250 255 260 

aac tgg gag tct cag cca gac ccc aac ate gcg aat cgc ate atg cag 872 
Asn Trp Glu Ser Gin Pro Asp Pro Asn lie Ala Asn Arg lie Met Gin 
10 265 270 275 

cgc ate gtc gag gtg egg ccc gag att gec aac ggc aag ggc gtc aag 920 
Arg lie Val Glu Val Arg Pro Glu lie Ala Asn Gly Lys Gly Val Lys 

280 285 290 

ggg ctg age gtg ate cga cac gee gtc ggc atg egg ccg tgg cga aag 968 
15 Gly Leu Ser Val lie Arg His Ala Val Gly Met Arg Pro Trp Arg Lys 
295 300 305 

gac gga gtc agg ate gag gag gag aag ctg gat gat gag act tgg ate 1016 
Asp Gly Val Arg lie Glu Glu Glu Lys Leu. Asp Asp Glu Thr Trp lie 
310 315 • 320 325 

20 gtg cac aac tac gga cac tct gga tgg ggt tac cag ggt teg tat ggt 1064 
Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr Gin Gly Ser Tyr Gly 

330 335 340 

tgt get gag aat gta gtc cag ttg gtt gac aag gtc ggc aag gcg gee 1112 
Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys Val Gly Lys Ala Ala 
25 345 350 355 

aag tct aag ctg tagttgaaaa ggcctgaatg agtaatagta attggatatt 1164 
Lys Ser Lys Leu 
360 

ggaaataccg tatttgeect eg 118 6 

30 . 

<210> 6 
<211> 361 
<212> PRT 

<213> Nectria haematococca 

35 . 

<400> 6 

Met Ser Asn Thr He Val Val Val Gly Ala Gly Val He Gly Leu Thr 
1 * 5 10 15 

40 Ser Ala Leu Leu Leu Ser Lys Asn Lys Gly Asn Lys He Thr Val Val 
20 25 30 

Ala Lys His Met Pro Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe 
35 ' 40 ' 45 



45 



Ala Gly Ala Asn His Ser Pro Met Ala Thr* Glu Glu Ser Ser Glu Trp 
50 55 " 60 



WO 2005/090582 PCT/EP2005/002735 

10/34 

Glu Arg Arg Thr Trp Tyr Glu Phe Lys Arg Leu Vai Glu Glu Val Pro 
65 70 75 80 

5 Glu Ala Gly Val His Phe Gin Lys Ser Arg lie Gin Arg Arg Asn Val 

85 90 95 

Asp Thr Glu Lys Ala Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser 
100 105 110 

10 

Lys Glu Pro Trp Phe Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His 
115 120 125 

Pro Ser Glu Val lie Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser 
15 130 135 14 0 

Val Cys He Asn Thr Ala He Tyr Leu Pro Trp Leu Leu Gly Gin Cys 
145 150 155 160 

20 He Lys Asn Gly Val lie Val Lys Arg Ala He Le*u Asn Asp He Ser 

165 170 175 

Glu Ala Lys Lys Leu Ser His. Ala Gly Lys Thr Pro Asn He He Val 
180 ' 185 190 

25 

Asn Ala Thr Gly Leu Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys 
195 200 205 

Thr Met Ala Pro Ala * Arg Gly Gin He Val Val Val Arg Asn Glu Ser 
30 210 215 22 0 

Ser Pro Met Leu Leu Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val 
225 230 235 24 0 

35 Met Tyr Leu Met Gin Arg Ala Ala Gly Gly Gly Th.x He Leu Gly Gly 

245 250 255 

Thr Tyr Asp Val Gly Asn Trp Glu Ser Gin Pro Asp Pro Asn He Ala 
260 ' 265 270 

40 

Asn Arg He Met Gin Arg He Val Glu Val Arg Pr-o Glu He Ala Asn 
275 280 285 

Gly Lys Gly Val Lys Gly Leu Ser Val He Arg His Ala Val Gly Met 
45 290 295 300 



Arg Pro Trp Arg Lys Asp Gly Val Arg He Glu Glu Glu Lys Leu Asp 



WO 2005/090582 PCT/EP2005/002735 

11/34 

305 310 315 320 

Asp Glu Thr Trp lie Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr 
325 330 335 

5 

Gin Gly Ser Tyr Gly Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys 
340 345 350 

Val Gly Lys Ala Ala Lys Ser Lys Leu 
10 355 360 

<210> 7 

<211> 1071 

<212> DNA 

15 <213> Trigonopsis variabilis 

<220> 

<221> CDS 
<222> (1)..(1068) 
20 <223> 

<400> 7 

atg get aaa ate gtt gtt att ggt gec ggt gtt gec <ggt tta act aca 48 

Met Ala Lys lie Val Val He Gly Ala Gly Val Ala Gly Leu Thr Thr 

25 ' 1 5 .10 15 

get ctt caa ctt ctt cgt aaa gga cat gag gtt aca att gtg tec gag 96 

Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr He Val Ser Glu 

20 25 30 

ttt acg ccc ggt gat ctt agt ate gga tat acc teg cct tgg gca ggt 14 4 

30 * Phe Thr Pro Gly Asp Leu Ser He Gly Tyr Thr Ser Pro Trp Ala Gly 

35 40 45 

gec aac tgg etc aca ttt tac gat gga ggc aag tta gee gac tac gat ' 192 

Ala Asn Trp Leu Thr Phe . Tyr Asp Gly Gly .Lys Leu Ala Asp Tyr Asp 
50 55 60 

35 gec gtc tct tat cct ate ttg cga gag ctg get cga age age ccc gag 240 

Ala Val Ser Tyr Pro He Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 

65 70 75 80 

get gga att cga etc ate age caa cgc tec cat gtt etc aag cgt gat 288 

Ala Gly He Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 

40 85 90 95 

ctt cct aaa ctg gaa gtt gee atg teg gec ate tgt caa cgc aat ccc 336 

Leu Pro Lys Leu Glu Val Ala Met Ser Ala He Cys Gin Arg Asn Pro 

100 105 110 

tgg ttc aaa aac aca gtc gat tct ttc'gag att ate gag gac agg tec 384 

45 Trp Phe Lys Asn Thr Val Asp Ser Phe Glu lie He Glu Asp Arg Ser 

115 120 125 

agg att gtc cac gat gat gtg get tat eta gtc gaa ttt cgt tec gtt 432 
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Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 

130 135 140 

tgt ate cac acc gga gtc tac ttg aac tgg ctg atg tec caa tgc -tta 4 80 

Cys lie His Thx Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys leu 
5 145 150 155 160 

teg etc ggc gee acg gtg gtt aaa cgt cga gtg aac cat ate aag gat 528 
Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His lie Lys -Asp 

165 170 175 

gee aat tta eta cac tec tea gga tea cgc ccc gac gtg att gtc aac 57 6 

10 Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val lie Val .Asn 
180 185 190 

tgt agt ggt etc ttt gee egg ttc ttg gga ggc gtc gag gac aag aag 624 
Cys Ser' Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Xys 
195 ' 200 205 

15 atg tac cct att cga gga caa gtc gtc ctt gtt cga aac tct ctt cct 672 
Met Tyr Pro lie Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 

210 215 220 

ttt atg gec tec ttt tec age act cct gaa aaa gaa aat gaa gac gaa 720 
Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 
20 225 230 235 * 240 

get eta tat ate atg acc cga ttc gat ggt act tct ate att ggc ggt 7 68 

Ala Leu Tyr lie Met Thr Arg Phe Asp Gly Thr Ser lie lie Gly Gly 

245 . 250 255 

tgt ttc caa ccc. aac aac tgg tea tec gaa ccc gat cct tct etc acc 816 
25 Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 
260 265 270 

cat cga ate ctg tct aga gee etc gac cga ttc ccg gaa ctg acc aaa 8 64 

His Arg lie Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
275 280 . ■ 285 

30 gat ggc cct ctt gac att gtg cgc gaa tgc gtt ggc- cac cgt cct ggt 912 
Asp Gly Pro Leu Asp lie Val Arg Glu Cys Val Gly His Arg Pro Gly 

290 295 .... 300 

aga gag ggc ggt ccc- cga gta gaa tta gag aag ate ccc ggc gtt ggc 960 
Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys He Pro Gly Val Gly . 
35 305 310 315- 320 

ttt gtt gtc cat aac tat ggt gec gee ggt get" ggt tac caa tec tct 1008 
Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 . 330 335 

tac ggc atg get gat gaa get gtt tct tac .gtc gaa aga get ctt act 1056 
40 Tyr Gly Met Ala Asp Glu Ala Val Ser. Tyr Val Glu Arg Ala Leu Thr 
340 345 350 

cgt cca aac ctt tag 1071 
Arg Pro Asn Leu 
355 



45 



<210> 8 
<211> 356 



WO 2005/090582 PCT/EP2005/002735 

13/34 

<212> PRT 

<213> Trigonopsis variabilis 
<400> 8 

5 Met Ala Lys He Val Val He Gly Ala Gly Val Ala Gly Leu Thr Thr 
15 10 15 

Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr He Val Ser Glu 
20 25 30 

10 

Phe Thr Pro Gly Asp Leu Ser He Gly Tyr Thr Ser Pro Trp Ala Gly 
35 40 45 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
15 50 55 60 

Ala Val Ser Tyr Pro He Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 75 80 

20 Ala Gly He Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 

85 90 95 

Leu Pro Lys Leu Glu Val Ala Met ■ Ser Ala He Cys Gin Arg Asn Pro 
100 105 110 

25 

Trp Phe Lys Asn Thr. Val Asp Ser Phe Glu lie He Glu Asp Arg Ser 
115 120 125 

Arg He Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 
30 130 135 140 

Cys He His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 
145 150 155 160 

35 Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His He Lys Asp 

165 170 175 

Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val He Val Asn 
180 185 190 

40 

Cys Ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Lys 
195 200 205 

'Met Tyr Pro He Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 
45 210 215 220 



Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 
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225 230 235 240 

Ala Leu Tyr He Met Thr Arg Phe Asp Gly Thr Ser He He Gly Gly 
245 250 255 

5 

Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 
260 265 270 

His Arg He Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
10 275 280 285 ■ 

Asp Gly Pro Leu Asp He Val Arg Glu Cys Val Gly His Arg Pro Gly 
290 295 300 

15 Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys He Pro Gly Val Gly 
305 310 315 320 

Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 
325 • 330 335 



20 



Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 
'340 345 - 350 



Arg Pro Asn Leu 
25 355 

<210> 9 

<211> 1047 

<212> DMA 

30 <213> Schizosaccharomyces pombe. 

<220> 

<221> CDS 

<222> (22).. (1041) 

35 <223> coding for DAAO 

<400> 9 

atgactaagg aaaataagcc a aga gat att gtc ate gtt ggc get ggc gtt 51 

Arg Asp He Val He Val Gly Ala Gly Val 

40 1 5 10 

att gga ttg acc act get tgg att ctt tea gac ttg ggt ctt get cct 99 

He Gly Leu Thr Thr Ala Trp He Leu Ser Asp Leu Gly Leu Ala Pro 

15 20 25 

cgt att aag gtg att gec aag tat acg cct gaa gat cgt tct gta gaa 147 

45 Arg He Lys Val He Ala Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu 

30 35 40 

tac act tec cct tgg get ggc gca aat ttc tgt age att tct get act 195 



• « • 
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Tyr Thr Ser Pro Trp Ala Gly Ala Asn Phe Cys Ser lie Ser Ala Thr 

45 50 55 

gat gac aat get ttg cgc tgg gat aaa ate act tac cat cgt ttc gec 243 
Asp Asp Asn Ala Leu Arg Trp Asp Lys lie Thr Tyr His Arg Phe Ala 
5 60 65 70 

tac ttg gcg aaa act cgt cct gaa gca gga ate cgt ttt get gat ctt 291 
Tyr Leu Ala Lys Thr Arg Pro Glu Ala Gly lie Arg Phe Ala Asp Leu 
75 80 85 90 

cga gaa ttg tgg gag tac gag ccg aaa cac gac aaa ate aga tec tgg 339 
10 Arg Glu Leu Trp Glu Tyr Glu Pro Lys His Asp Lys lie Arg Ser Trp ' 

95 100 105 

aat acc tat gtc aga gat ttc aaa gtt ate cct- gaa aaa gat ctt cca 387 
Asn Thr Tyr Val Arg Asp Phe Lys Val lie Pro Glu Lys Asp Leu Pro 
110 115 120 

15 gga gaa tgt ate tac gga cat aag gec acc acc ttt tta ate aac get 435 
Gly Glu Cys lie Tyr Gly His Lys Ala Thr Thr Phe Leu lie Asn Ala 

125 130 135 

cct cat tac ttg aat tat atg tac aag ctg etc att gaa get ggc gtc 4-83 
Pro His Tyr Leu Asn Tyr Met Tyr Lys Leu Leu lie Glu Ala Gly Val 
20 140 . 145 150 

gaa ttt gaa aag aaa gaa ttg agt cac ate aaa gag act gtc gaa gaa 531 
Glu Phe Glu' Lys Lys Glu Leu Ser His lie Lys Glu Thr Val Glu Glu 
155 160 165 170 

act cca gaa get tea gta gta ttt aat tgc act ggt etc tgg get tec 579 
25 Thr Pro Glu Ala Ser Val Val Phe Asn Cys Thr Gly Leu Trp Ala Ser 

175 180 185 

aaa ttg ggt ggc gtt gaa gac ccg gac gtt tat ccg act cgt gga cat 627 
Lys Leu Gly Gly Val Glu Asp Pro Asp Val Tyr Pro Thr Arg Gly His 
190" 195 • 200 

30 gtt gtt ttg gtt aag get cct- cat-" gta • aca gaa act cgc att ttg aat 675 
Val Val Leu Val Lys Ala Pro His Val Thr Glu Thr Arg lie Leu Asn 

205 210 215 

ggc aag aac tct gat acc tat att att cct cgt ccc tta aat ggt gga 723 
Gly Lys Asn Ser Asp Thr Tyr lie lie Pro Arg Pro Leu Asn Gly Gly 
35 220 225 230 

gtc att tgc ggc ggt ttc atg caa cca gga aac tgg gat cgt gaa att 771 
Val lie Cys Gly Gly Phe Met Gin Pro Gly Asn Trp Asp Arg Glu lie 
235 240 245 250 

cac cct gaa gac act ttg gat ate ctt aag aga aca teg get ttg atg 819 
40 His Pro Glu Asp Thr Leu Asp lie Leu Lys Arg Thr Ser Ala Leu Met 

255 260 265 

cca gaa ttg ttc cac ggc aag ggt ccg gag ggt get gaa att att caa 8 67 

Pro Glu Leu Phe His Gly Lys Gly Pro Glu Gly Ala Glu lie lie Gin 
270 275 280 

45 gaa tgt gtc gga ttc cgt cct tct cga aag ggt ggt gec cgc gta gag 915 
Glu Cys Val Gly Phe Arg Pro Ser Arg Lys Gly Gly Ala Arg Val Glu 
285 290 295 



10 
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ctt gat gtt gtt ccc ggc acc tea gtc ccc ctt gtt cat gat tac ggt 963 
Leu Asp Val Val Pro Gly Thr Ser Val Pro Leu Val His Asp Tyr Gly . 

300 305 310 

get tct ggc aca gga tac caa get ggt tat ggt atg get ctt gac tct 1011 
Ala Ser Gly Thr Gly Tyr Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser 
315 320 325 330 

gtc atg ttg get ctt cct aaa ate aaa ttg gcttag 1047 
Val Met Leu Ala Leu Pro Lys lie Lys Leu 
335 340 

<210> 10 
<211> 340 
<212> PRT 

<213>- Schizosaccharomyces pombe 
<400> 10 

Arg Asp lie Val lie Val Gly Ala Gly Val lie Gly Leu Thr Thr Ala 
1 5 10 15 

20 Trp lie Leu Ser Asp Leu Gly Leu Ala Pro Arg lie Lys Val lie Ala 
20 25 30 

Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu Tyr Thr Ser Pro Trp Ala 
35 40 45 



15 



25 



Gly Ala Asn Phe Cys Ser lie Ser Ala Thr Asp Asp Asn Ala Leu Arg 
50 55 60 



Trp Asp Lys lie Thr Tyr His Arg Phe Ala Tyr Leu Ala Lys Thr Arg 
30 65 70 75 80 

Pro Glu Ala Gly lie Arg Phe Ala Asp Leu Arg Glu Leu Trp Glu Tyr 
85 90 95 

35 Glu Pro Lys His Asp Lys lie Arg Ser Trp Asn Thr Tyr Val Arg Asp 
100 105 110 

Phe Lys Val lie Pro Glu Lys Asp Leu Pro Gly Glu Cys lie Tyr Gly 
115 . 120 125 

40 

His Lys Ala Thr Thr Phe Leu lie Asn Ala Pro His Tyr Leu Asn Tyr 
130 135 140 

Met Tyr Lys Leu Leu lie Glu Ala Gly Val Glu Phe Glu Lys Lys Glu 
45 145 150 155 160 



Leu Ser His lie Lys Glu Thr Val Glu Glu Thr Pro Glu Ala Ser Val 
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165 



17/34 
170 



PCT/EP2005/002735 
IL75 



Val Phe Asn Cys Thr Gly Leu Trp Ala Ser Lys Leu Gly Gly \/al Glu 
180 185 190 

5 

Asp Pro Asp Val Tyr Pro Thr Arg Gly His Val Val Leu Val Xiys Ala 
195 200 205 

Pro His Val Thr Glu Thr Arg lie Leu Asn Gly Lys Asn Ser Asp Thr 
10 210 215 220 

Tyr lie lie Pro Arg Pro Leu Asn Gly Gly Val lie Cys Gly Gly Phe 
225 230 235 240 

15 Met Gin Pro Gly Asn Trp Asp Arg Glu lie His Pro Glu Asp Thr Leu 

245 250 255 

Asp lie Leu Lys Arg Thr Ser Ala Leu Met Pro Glu Leu Phe Sis Gly 
260 265 270 



20 



Lys Gly Pro Glu Gly Ala Glu lie lie Gin Glu Cys Val Gly !Phe Arg 
275 280 " 285 



Pro Ser Arg Lys Gly Gly Ala Arg Val Glu Leu Asp Val Val Pro Gly 
25 290 295 300 

Thr Ser Val Pro Leu Val His Asp Tyr Gly Ala Ser Gly Thr Gly Tyr 
305 310 315 320 

30 Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser Val Met Leu Ala Xeu Pro 

325 330 335 

Lys lie Lys Leu 
340 



35 



<210> 11 

<211> 963 

<212> DMA 

<213> Streptoiayces coelicolor 



40 



<220> 

<221> CDS 

<222> (31).. (957) 

<223> 'coding for DAAO 

45 

<220> 

<221> misc feature 
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<222> (880) . . (936) 
<223> DAAO signature 

<400> 11 

5 gtggaaaccg aactggatga cgagcgggat ggc gaa gtc gtc gtg gtc ggc ggc 54 

Gly Glu Val Val Val Val Gly Gly 
1 5 

ggg gtg ate ggg ctg acg acg gec gtc gtc etc gec gag egg ggc aga 102 

Gly Val lie Gly Leu Thx Thr Ala Val Val Leu Ala Glu Arg Gly Arg 

10 10 15 20 

egg gtg egg ctg tgg ace egg gag ccc gcg gag egg ace ace teg gtg 150 

Arg Val Arg Leu Trp Thr Arg Glu Pro Ala Glu Arg Thr Thr Ser Val 

25 30 -35 40 

gta gcg ggc ggg ctg tgg tgg ccg tac egg ate gag ccg gtc gcg ctg 198 

15 Val Ala Gly Gly Leu Trp Trp Pro Tyr Arg lie Glu Pro Val Ala Leu 

45 50 55 

gec cag gee tgg gcg ctg cgt tec ctg gac gtg tac gag gag ctg gcg 24 6 

Ala Gin Ala Trp Ala Leu Arg Ser Leu Asp Val Tyr Glu Glu Leu Ala 
60 65 70 * 

20 gca egg ccc ggg cag acc ggc gta cgc atg etc gaa ggg gtg etc ggc 294 

Ala Arg Pro Gly Gin Thr Gly Val Arg Met Leu Glu Gly Val Leu Gly 

75 80 85 

gag acc ggc ctg gac gag gtg gac .ggg tgg gee gcg gec egg ctg ccg 342 

Glu Thr Gly Leu Asp Glu Val Asp Gly Trp Ala Ala Ala Arg Leu Pro 

25 90 95 100 

ggg ctg cgc gcg gcg age gec gec gag tac gee ggg acg ggg ctg tgg 390 

Gly Leu Arg Ala Ala Ser Ala Ala Glu Tyr Ala Gly Thr Gly Leu Trp 

105 110 115 120 

gcg egg ctg ccg etc ate gac atg teg acc cat ctg ccg tgg ctg egg 438 

30 Ala Arg Leu Pro Leu lie Asp Met Ser Thr His Leu Pro Trp Leu Arg 

125 130 135 

gag egg ctg ctg gee gcg ggc ggc acg gtg gag gac cgc gcg gtg acc 486 

Glu Arg Leu Leu Ala Ala Gly Gly Thr Val Glu Asp Arg Ala Val Thr 
140 145 150 

35 gat ctg gee gag gcg gac gcg ccg gtg gtg gtc aac tgc acc ggc ctg 534 

Asp Leu Ala Glu Ala Asp Ala Pro Val Val Val Asn Cys Thr Gly Leu 

155 160 165 

ggc gee egg gag ctg gtg ccg gac ccg gcg gta egg ccg gtg cgc gga 582 

Gly Ala Arg Glu Leu Val Pro Asp Pro Ala Val Arg Pro Val Arg Gly 

40 170 175 180 

cag ctg gtc gtc gtg gag aac ccc ggc ate cac aac tgg ctg gtc gcg 630 

Gin Leu Val Val Val Glu Asn Pro Gly lie His Asn Trp Leu Val Ala 

185 190 195 200 

gec gac gcg .gac tec ggg gag acg acg tac ttc ctt ccg cag ccg gga 678 

45 Ala Asp Ala Asp Ser Gly Glu Thr Thr Tyr Phe Leu Pro Gin Pro Gly 

205 210 215 

egg etc ctg ctg ggc ggc acg get gag gag gac gee tgg teg acc gag 72 6 



* • 
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10 



15 



Arg Leu 



ccg 
Pro 

cgt 
Arg 

egg 
Arg 
265 

ggg 

Gly 



gac 
Asp 

ccc 
Pro 
250 
ccg 
Pro 



Leu Leu 
220 
ccg gag 
Pro Glu 
235 

gag ate 
Glu He 

gec egg 
Ala Arg 



cgc egg ctg 
Arg Arg Leu 



gtg gee tgg ggc 
Val Ala Trp Gly 
300 



Gly Gly Thr 

gtc gcg gcg 
Val Ala Ala 

gee gga gcg 
Ala Gly Ala 
255 

gac gcg gtc 
Asp Ala Val 

270 
gtg cac aac 
Val His Asn 
285 

tgc get cag 
Cys Ala Gin 



19/34 

Ala Glu Glu Asp 
225 

gec ate gtg cga 
Ala He Val Arg 
240 

egg gtg etc gcg 
Arg Val Leu Ala 



egg etg 
Arg Leu 

tac ggt 
Tyr Gly 

gag gcg 
Glu Ala 
305 



gag cgc 
Glu Arg 
275 
cac ggc 
His Gly 
290 

gee egg 
Ala Arg 
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Ala Trp Ser Thr Glu 
230 

egg tgc gcg gec ctg 774 
Arg Cys Ala Ala Leu 
245 

cac ctg gtg ggg ctg 822 

His Leu Val Gly Leu 

260 

ggg acg ctg ccg gac 870 
Gly Thr Leu Pro Asp 
280 

ggc gcg ggc gtc acc 918 
Gly Ala Gly Val Thr 
295 

etc gee tcctga 963 
Leu Ala 



<210> 12 

20 <211> 309 

<212> PRT 

<213> Streptomyces coelicolor 
<220> 

25 <221> misc_feature 

<222> (880) . . (936) 

<223> DAAO signature 

<400> 12 

30 Gly Glu Val Val Val Val Gly Gly Gly Val He Gly Leu Thr- Thr Ala 
15 10 15 



35 



Val Val Leu Ala Glu Arg Gly Arg Arg Val Arg Leu Trp Thr Arg Glu 
20 25 30 

Pro Ala Glu Arg Thr Thr Ser Val Val Ala Gly Gly Leu Trp Trp Pro 
35 40 45 



Tyr Arg He Glu Pro Val Ala Leu Ala Gin Ala Trp Ala Leu Arg Ser 
40 50 55 60 

Leu Asp Val Tyr Glu Glu Leu Ala Ala Arg Pro Gly Gin Thr Gly Val 

65 70 75 80 



45 Arg Met Leu Glu Gly Val Leu Gly Glu Thr Gly Leu Asp Glu Val Asp 

85 90 95 
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Gly Trp Ala Ala Ala Arg Leu Pro Gly Leu Arg Ala Ala Ser Ala Ala 

100 105 110 

Glu Tyr Ala Gly Thr Gly Leu Trp Ala Arg Leu Pro Leu lie Asp Met 
5 115 120 125 

Ser Thr His Leu Pro Trp Leu Arg Glu Arg Leu Leu Ala Ala Gly Gly 
130 135 140 

10 Thr Val Glu Asp Arg Ala Val Thr Asp Leu Ala Glu Ala Asp Ala Pro 
145 150 155 160 

Val Val Val Asn Cys Thr Gly Leu Gly Ala Arg Glu Leu Val Pro Asp 
165 170 175 

15 

Pro Ala Val Arg Pro Val Arg Gly Gin Leu Val Val Val Glu Asn Pro 
180 185 190 

Gly lie ' His Asn Trp Leu Val Ala Ala Asp Ala Asp Ser Gly Glu Thr 
20 195 200 205 

Thr Tyr Phe Leu Pro Gin. Pro Gly Arg Leu Leu Leu Gly Gly Thr Ala 
210 215 . 220 

25 Glu Glu Asp Ala Trp Ser Thr Glu Pro Asp Pro Glu Val Ala Ala Ala 
225 230 235 240 

lie Val Arg Arg Cys Ala Ala Leu Arg Pro Glu lie Ala Gly Ala Arg 
245 250 255 

30 

Val Leu Ala His Leu Val Gly Leu Arg Pro Ala Arg Asp Ala Val Arg 
260 265 270 

Leu Glu Arg Gly Thr Leu Pro Asp Gly Arg Arg Leu Val His Asn Tyr 
35 275 280 285 

Gly His Gly Gly Ala Gly Val Thr Val Ala Trp Gly Cys Ala Gin Glu 
290 295 300 

40 Ala Ala Arg Leu Ala 
305 

<210> 13 
<211> 1038 
45 <212> DNA 

<213> Candida boidinii 
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<220> 

<221> CDS 

<222> (1)..(1035) 

<223> coding for DAAO 

5 

<400> 13 

atg ggt gat caa att gtt gtt ctt ggt tec ggt att att ggt tta tat 48 
Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie He Gly Leu Tyr 
15 10 15 

10 act aca tac tgt tta ate tat gag get gga tgt get cca get aaa att 96 
Thr Thr Tyr Cys Leu He Tyr Glu Ala Gly Cys Ala Pro Ala Lys He 

20 25 30 

act att gtt get gaa ttt tta cca ggt gat caa tct aca tta tat aca 14 4 

Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 

15 35 40 45 

tct cca tgg gca ggt ggt aat ttt tct tgt att tea cca get gat gat 192 
Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys He Ser Pro Ala Asp Asp 

50 55 60 

aca aca ttg get tat gat aaa ttc aca tat ctt aat tta ttc aag att 240 

20 Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys He 
65 70 75 80 

cac aaa aaa tta ggt gga cca gaa tgt gga tta gat. aat aag cca agt 288 
His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 
85 90 95 

25 act gaa tat tgg gat ttt tat cct.ggt gat gaa aaa gtc aat tct tta 336 
Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 ■ 105 110 

aaa caa tat ctt aaa gat ttt aaa gtt att cca aaa tea gaa tta cca 384 
Lys Gin Tyr Leu Lys Asp Phe Lys Val He Pro Lys Ser Glu Leu Pro 

30 115 120 . 125 

gaa ggt gtt gaa tat ggt att agt tat act aca tgg aat ttc aac tgt 432 
Glu Gly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe Asn Cys 

130 135 140 

cct gtt ttc tta caa aat atg get aat ttt tta aat aaa aga aat gtt 4 80 

35 Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
145 150 155 160 

acc att att aga aaa cat tta aca cat att tct caa get tat tta aca 528 
Thr He He Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 
165 170 175 

40 gtt aat aca aaa gtt gtt ttc aac tgt aca ggt att ggt get get gat 57 6 

Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly He Gly Ala Ala Asp 

180 185 190 

tta ggt ggt gtt aaa gat gaa aaa gtt tat cca act aga- gga caa gtt 624 
Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 

45 195- 200 205 

gtt gtt gtt aga get cca cat att caa gaa aat aaa atg aga tgg ggt 672 
Val Val Val Arg Ala Pro His lie Gin Glu Asn Lys Met Arg Trp Gly 
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210 215 220 

aaa gac tat get act tat att att cca aga cca tat tct aat ggt gaa 720 
Lys Asp Tyr Ala Thr Tyr lie He Pro Arg Pro* Tyr Ser Asn Gly Glu 
225 230 235 240 

5 tta gtc tta ggt ggt ttc tta caa aag gat aat tgg aca ggt aat act 7 68 

Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

ttt ggt ttt gaa act gat gat att gtt agt aga act aca tct tta tta 816 
Phe Gly Phe Glu Thr Asp Asp He Val Ser Arg Thr Thr Ser Leu Leu 
10 260 265 270 

cca aag att tta gat gaa cca ctt cat att att aga gtt gca get ggt 864 
Pro Lys He Leu Asp Glu Pro Leu His lie He Arg Val Ala Ala Gly 

275 280 285 

tta aga cca agt aga cat ggt ggt cca aga att gaa get gaa gtt tgt 912 
15 Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

gaa gaa ggt aaa tta act att cat aat tat ggt get tct gga tat ggt 960 
Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
305 310 315 320 

20 tat caa get ggt tat ggt atg tct tat gaa get gtc aaa ctt tta gtt 1008 
Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 

gat aac caa aaa gtt* aaa get aaa ctt tag 1038 
Asp Asn Gin Lys Val Lys Ala Lys Leu 
25 340 345 

<210> 14 

<211> 345 

<212> PRT 

30 <213> Candida boidinii 

<400> 14 

Met Gly Asp Gin He Val Val Leu Gly Ser Gly He He Gly Leu Tyr 
1-5 10 15 



35 



Thr Thr Tyr Cys Leu He Tyr Glu Ala Gly Cys Ala Pro Ala Lys He 
20 25 30 



Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
40 35 40 45 

Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys He Ser Pro Ala Asp Asp 
50 55 60 

45 Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys He 
65 70 75 80 
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His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 
85 90 95 

Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 
5 100- 105 110 

Lys Gin Tyr Leu Lys Asp Phe Lys Val lie Pro Lys Ser Glu Leu Pro 
115 120 125 

10 Glu Gly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe Asn Cys 
130 135 140 

Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
145 150 155 160 

15 

Thr He He Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 
165 170 175 

Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly He Gly Ala Ala Asp 
20 180 185 190 

Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

25 Val Val Val Arg Ala Pro His He Gin Glu Asn Lys Met Arg Trp Gly 
210 215 220 

Lys Asp Tyr Ala Thr Tyr He He Pro Arg Pro Tyr Ser Asn Gly Glu 
225 230 235 240 

30 

Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 
245 250 255 

Phe Gly Phe Glu Thr Asp Asp He Val Ser Arg Thr Thr Ser Leu Leu 
35 260 265 270 

Pro Lys He Leu Asp Glu Pro Leu His He He Arg Val Ala Ala Gly 
275 280 285 

40 Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
"305 310 315 320' 

45 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 
325 330 335 
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Asp Asn Gin Lys Val Lys Ala Lys Leu 
340 345 

<210> 15 
<211> 12334 
<212> DNA 

<213> STPT GUS Nit-P daao 



10 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(38) . . (183) 

Agrobacterium Right Border 



15 



<220> 
<221> 
<222> 
<223> 



terminator 
(384) . . (639) 

complementary: NOS terminator 



20 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(716) . - (1822) 
complementary : 



coding for Rhodotorula gracilis DAAO 



25 



<220> 
<221> 
<222> 
<223> 



promoter 
(1866) . . (3677) 

complementary: A.thaliana nitrilase I promoter 



30 



<220> 
<221> 
<222> 
<223> 



terminator 
(3767) . . (3971) 

complementary: 35S terminator 



35 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(4046) . . (6043) 
complementary: 



coding for beta-glucuronidase (comprising intron) 



40 



<220> 
<221> 
<222> 
<223> 



promoter 
(6097) . . (7414) 

complementary: sTPT promoter 



45 



<220> 
<221> 



misc feature 



WO 2005/090582 PCT/EP2005/002735 

25/34 

<222> (7486) . . (7702) 

<223> Agrobacterium Left Border 

<400> 15 

5 aatattcaaa caaacacata cagcgcgact tatcatggac atacaaatgg acgaacggat 60 

aaaccttttc acgccctttt aaatatccga ttattctaat aaacgctctt ttctcttagg 120 

tttacccgcc aatatatcct gtcaaacact gatagtttaa actgaaggcg ggaaacgaca 180 

atcagatctg gtaccttgcg ccgggtaccc caaactgtct cacgacgttt tgaacccaga 240 

ttaccctgtt atccctagtc gagcggccgc cagtgtgatg gatatctgca gaattcgccc 300 

10 ttttagatca gcacactggc ggccgttact agtggatcaa ttcactggcc gtcgttttac 360 

aacgactcag agcttgacag gaggcccgat ctagtaacat agatgacacc gcgcgcgata 420 

atttatccta gtttgcgcgc tatattttgt tttctatcgc gtattaaatg tataattgcg 4 80 

ggactctaat cataaaaacc catctcataa ataacgtcat gcattacatg ttaattatta 540 

catgcttaac gtaattcaac agaaattata tgataatcat cgcaagaccg gcaacaggat 600 

15 tcaatcttaa gaaactttat tgccaaatgt ttgaacgatc ggggatcatc cgggtctgtg 660 

gcgggaactc cacgaaaata tccgaacgca gcaagatcta gagcttgggt cccgcctaca 720 

acttcgactc ccgcgccgcg ccgtggtacc gctggaacgc ctcgtcgacg agctgcgcga 780 

catcctccgc cgcgccccaa ctctgctggt atcccgcact cgagaagcca tacgcatgca 840 

caagcgtgac ctccttctcc ttcgccgctc gtgcgctgcc cctgccgagc gagaggggcg 900 

20 actttgtccg gtcgagaggc aggacgatcc gttctgcctc aacgcgg.ggt ccgcctcgtc 960 

gtgcaggtcg caagccgacg t.tgtggcgga ggacctcgat gccttcgatc ' gttccgtcgc 1020 

tcgagatggt cgggtcgagg cgcaagcagt gcttgaggat ccgctggacc gtctctgggt 1080 

tgacagacaa gtcccagtct cccacgccgt acgtcccgcc gcagatgact tcgccacctg 1140 

gtcggggaat gatgtaggcg ggagaagcgg ggtcggacga gtccatcgtg catcgcttgc 1200 

25 atggggactt gacgaggacg gtttgcccgc ggattggctc ggcggcttgg tcgtcgatgc 1260. 

ccgcaatcga cttggcgcca agtcccgtag cgttgaccac caaatccgca ccgtcgaacg 1320 

cctgctcaag cgacgtaacg gtccgtctct caaacgtcgc gccgagcttc tgcagctctc 1380 

ttgcaaggta ctggcagtac tttggtgcgt ggacggagag ggtgtcgtag gttacgccga 144 0 

tagcgccagg tggacattcg gaagatggga gggggcggta atttggcgtg atgtccttgt 1500 

30 accagtgccc gagcaagccg tcttcgttct gcgcgaaccg cctcgtcccc ttgagccaca 1560 

tggcatggcc cgtcgggacc aactcgaccc acttcttgaa agtcgattct tcccattttg 1620 

cttgtcgagg accgtctgta agcgtcatga aaggcgtcca attcgcgcca gcccatggtg 1680 

aagcgaaagt ctggctcgag acgtcctccg gcaagtcgcg cgcgagaata tgcacgctgt 1740 

agcccttccg agcgaggatg agggcgctgc tcagaccgat aacgcctgat ccgaggacaa 1800 

35 cgacgcgctt ctgcgagtgc atgggccctc gactagagtc gagatccgat atcgcccggg 18 60 

.ctcgagtctt tgttttttac tttggttcat gacactcaga gacttgagag aagcaatata 1920 

tagacttttt tttgtttttt ttttgtggtc acgtttattt tcctattgga gacggtaacg 1980 

aagatcgaac ctgtggtgga aatgaaacaa ggtgggacta gcccacgtgg tttcttttct 2040 

ctgcattgat ttgtttttgt tttttttgta aagttcacat caaacctact aataattgag 2100 

40 aagaaaaata aaatctattg attgattaaa ccagccgatg ctttatgtct gaatataaaa 2160 

aagaagtgaa aaccccgttt aagaattaca acggtggttt acaaagtatt tggacacaat 2220 

aaatccaaac gaaataaaac aaaatggaga actaccaaat aaaaaacaaa taaaaaactt 2280 

aaaagaattt attccatttt ttttcccgta gaatttattc ttttatggat tccttaaatc 2340 

catattfcgat gcattttgat tcctcataat aggtaataat atafcactatg ttatagatat '2400 

45 gtttctaatt cgtattaacc tacctttttt tggtcgtacg attctaccta ataatattga 24 60 

acggaattga tgttttggac cacttagaaa gtattttttt tttggtttgt cttagctgta 2520 

tttcattaaa tataaattta aataagaaat gtcataaata aaatttgacg tatagatttt 2580 
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ttaaatccat tttatgttat ttaatatttg 
ggatgggtta atactaagtt ttccttaatg 
ctaaaaaata gatcaatgtg ttttggtccg 
cgaaaagaaa gtctcatata gtaaatcata 
5 taacatggat tgtatagttt cttatataac 
caattaataa tcaaacgggc ttggtaggaa 
aaaacacaca ctcaagtctt ttatttcatc 
ttgccacttt tctttacaaa taaacacaaa 
caaacactgt ttgtcttttt tcgatcacgt 

10 gggaatgtaa aacaacaaca tgtgaagtca 
ttttcgttgt caatattagt cattccaaaa 
atattcttgg aactaatgta tgtggaaata' 
caaatatgca aagaaaaaaa agaaaagaac 
attggctaca tggttgtaga atgtagggta 

15 ggatgataga tggataatct ttagatacaa 
aataaagaaa aaaagtttag cataaaagtt 
tccttcatag gagtgacatg aactgacgaa 
ccgctttctc caccttcgaa atgaatgtga 
agtgtaaaac atctggtctc gaggtacctg 

20 acgtcgtgag acagtttggt gcaggtcgaa 
tggttttagg aattagaaat tttattgata. 
agggtttctt atatgctcaa cacatgagcg 
tgggaactac tcacacatta .ttctggagaa 
actggtgatt tttgcgccgg gtaccgagct 

25 atggtgcgcc aggagagttg ttgattcatt 
agttcatgcc agtccagcgt ttttgcagca 
gagtgaagat ccctttcttg ttaccgccaa- 
cggcgaaatt ccatacctgt tcaccgacga 
acatatccag ccatgcacac tgatactctt 

30 gcccggctaa cgtatccacg ccgtat'tcgg 
gcc'aggccag aagttctttt tccagtacct 
cataccatcc gtaataacgg ttcaggcaca 
tgtgagcgtc gcagaacatt acattgacgc 
gcgttgcttc cgccagtggc gaaatattcc 

35 tggcaatact ccacatcacc acgcttgggt 
tcgcctgtaa gtgcgcttgc tgagtttccc 
tcggcttgtt gcccgcttcg aaaccaatgc 
tttcatcaat caccacgatg ccatgttcat 
ggtaatgcga ggtacggtag gagttggccc 

40 ccatcagcac gttatcgaat cctttgccac 
cagtaaagta gaacggtttg tggttaatca 
- ggatgccgac gcgaagcggg tagatatcac 
gttcatagag ataaccttca cccggttgcc 
cgctagt'gcc ttgtccagtt gcaaccacct 

45 catcaccatt ggccaccacc tgccagtcaa 
gcgtcaccac ggtgatatcg tccacccagg 
gatggattcc ggcatagtta aagaaatcat 
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aaatgtgagt ttggctccta tttaatctta 2640 

aattatctca gagaaactgg attaaataaa 2700 

gtcaaatatc tttggattta ctattattgg 2760 

ttcctacaag agaaatcaaa atttttgaat 2820 

caattagttc gcatcaagaa aaccaaaccc 2880 

tatttcattg cagctttcag ataaaagaaa 2940 

tttcttactt gcaggaactc aaattccact 3000 

ttgtcaatga aacgaaatag tctttttatg 3060 

ttctgattgt gacagccatc catatatata 3120 

catatacgta atggtttagc atagcttcta 3180 

catttttaag aaaaataaat taatatatgt 3240 

cagtaactta attattaaac attctaaatg 3300 

acaactgaaa tcaaagccag attcataata 3360 

acacaacatc cagaattgaa cactcaaatt 3420 

gagaattggt tctcttccat tattaacgaa 34 80 

tgaaactcaa cataacattt tgaacttgac 3540 

tcacaaccga ttacttgttt gagtcatctt 3600 

ccggtttctt cgggtgctca tttacggtca 3660 

gtagggataa cagggtaatc tgggttcaaa 3720 

attcgagctc ggtacccggt cactggattt 37 80 

gaagtatttt acaaatacaa atacatacta 3840 

.aaaccctata agaaccctaa ttcccttatc 3900 

aaatagagag agatagattt gtagagagag 3960 

cggtagcaat tcccgaggct gtagccgacg 4020 

gtttgcctcc ctgctgcggt ttttcaccga 4080 

gaaaagccgc cgacttcggt ttgcggtcgc 4140 

cgcgcaatat gccttgcgag gtcgcaaaat 4200 

cggcgctgac gcgatcaaag acgcggtgat 4260 

cactccacat gtcggtgtac attgagtgca 4320 

tgatgataat cggctgatgc agtttctcct 438 0 

tctctgccgt ttccaaatcg ccgctttgga 4 440 

gcacatcaaa gagatcgctg atggtatcgg 4500 

aggtgatcgg acgcgtcggg tcgagtttac 4560 

cgtgcacttg cggacgggta tccggttcgt 4 620 

ggtttttgtc acgcgctatc agctctttaa 4680 

cgttgactgc ctcttcgctg tacagttctt 4740 

ctaaagagag gttaaagccg acagcagcag 4800 

ctgcccagtc gagcatctct tcagcgtaag 4 8 60 

caatccagtc cattaatgcg tggtcgtgca 4 920 

gtaagtccgc atcttcatga cgaccaaagc 4 980 

ggaactgttc gcccttcact gccactgacc 504 0 

actctgtctg gcttttggct gtgacgcaca 5100 

agaggtgcgg attcaccact tgcaaagtcc 5160 

gttgatccgc atcacgcagt tcaacgctga 5220 

cagacgcgtg gttacagtct tgcgcgacat 5280 

tgttcggcgt ggtgtagagc attacgctgc 534 0 

ggaagtaaga ctgctttttc ttgccgtttt 54 00 
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cgtcggtaat caccattccc ggcgggatag tctgccagtt cagttcgttg ttcacacaaa 54 60 

cggtgatacc tgcacatcaa caaattttgg tcatatatta gaaaagttat aaattaaaat 5520 

atacacactt ataaactaca gaaaagcaat tgctatatac tacattcttt tattttgaaa 5580 

aaaatatttg aaatattata ttactactaa ttaatgataa ttattatata tatatcaaag 5640 

5 gtagaagcag aaacttacgt acacttttcc cggcaataac atacggcgtg acatcggctt 5700 

caaatggcgt atagccgccc tgatgctcca tcacttcctg attattgacc cacactttgc 57 60 

cgtaatgagt gaccgcatcg aaacgcagca cgatacgctg gcctgcccaa cctttcggta 5820 

taaagacttc gcgctgatac cagacgttgc ccgcataatt acgaatatct gcatcggcga 5880 

actgatcgtt aaaactgcct ggcacagcaa ttgcccggct ttcttgtaac gcgctttccc 5940 

10 accaacgctg accaattcca cagttttcgc gatccagact gaatgcccac aggccgtcga 6000 

gttttttgat ttcacgggtt ggggtttcta caggacgtac catrrgtcgat cgactctaga 6060 

ctagtggatc cgatatcgcc cgggctcgac tctagatgaa atcgaaattc agagttttga 6120 

tagtgagagc aaagagggac ggacttatga ggatttcgag tatttcaaga gatggtactt 6180 

gttgatcgga cggctacgat gatctcgatt tggttaatcc agtatctcgc ggtgtatgga 6240 

15 gttatggtag ggttaatggt caatttcatc taacggtaga gaatgatgta attagataag 6300 

aatcttgaga tactggttta gattggatga gtgtagggtc catcttatct tgataagtgg 6360 

atggttttta gagacacagt gaatattagc caatcgaagt tccatatcac catcatcatc 6420 

tgtataattt tgtttttttg gaagataata atgattgaaa ttttggtaga ttttattttt 64 80 

cattatttac cttgtatgtt gagtggtctt caaattattg aacgtgacag attcacaaga 6540 

20 aagtagattt tttataaatg aaattttact tattttaaag gtatctctat ttaatttctt 6600 

ttgtttatgg ttgtctgtca gcatttgact tgcagtttca tgctcatagt catatacgtt 6660 

attctaggct tttttgaata tcttattact tttttcgtaa tacaatttta taattttatc 6720 

aaagttatac aactataact aaaattaggg ttttctacaa aacaaaaaaa tcttctaatt 6780 

ttttttgttg tagccagttt actcgtaagt tacaaaaaaa tacaaatgaa cccacatgta 684 0 

25 ttatgcgttt aactaggatt accatgtact ttcatgtact caattcaccc tatactcttt 6900 

tttttttttt ttctagttcc acccaatcta taaaattctg tccatttgac caaattcaat 6960 

taatttctgt aattgcgatt taaaattaat attacatgtt cactat-ttct cgatttgagg 7 020 

gaacccgagt ttaaatatga taaaaatgtt gacccatcac tacaaatatg ttatagttta 7 080 

tacttaatag tggtgttttt ggggataatt- gatgaattaa gtaaacatga' ttcttcttat 714 0 

30 gaagttgatt gagtgattat tgtatgtaaa cctatgtgat tgatgttatt ggttgattga 7200 

gtgattattg tattagtatg taagcaaaga tgattgttct tatgaggtaa tttgttactc 72 60 

attcatcctt ttgcatatga gaaattgtgt tagcgtacgc aaaacaatag agaacataaa 7320 

agatatgtgt atttatttaa ggtgactttt gttaatgata ttgtagtatc tatacattta 7380 

tatataactt gttgaatttg agtataagct atcaggatcc gggggatcct ctagagtcga 74 4 0 

35 ggtaccgagc tcgaattcac tggccgtcgt tttacaacga ctcagtactg cttggtaata 7500 

attgtcatta gattgttttt atgcatagat gcactcgaaa tcagccaatt ttagacaagt 7560 

atcaaacgga tgttaattca gtacattaaa gacgtccgca atgtgttatt aagttgtcta 7 620 

agcgtcaatt tgtttacacc acaatatatc ctgccaccag ccagccaaca gctccccgac 7 68 0 

cggcagctcg gcacaaaatc accacgcgtc taaaaaggtg atgtgtattt gagtaaaaca 77 40 

40 gcttgcgtca tgcggtcgct gcgtatatga tgcgatgagt aaataaacaa atacgcaagg 7 800 

ggaacgcatg aaggttatcg ctgtacttaa ccagaaaggc gggtcaggca agacgaccat 78 60 

cgcaacccat ctagcccgcg ccctgcaact cgccggggcc gatgttctgt tagtcgattc 7920 

cgatccccag ggcagtgccc gcgattgggc ggccgtgcgg gaagatcaac cgctaaccgt 7980 

tgtcggcatc gaccgcccga cgattgaccg cgacgtgaag gccatcggcc ggcgcgactt 8 040 

45 cgtagtgatc gacggagcgc cccaggcggc ggacttggct gtgtccgcga tcaaggcagc 8100 

cgacttcgtg ctgattccgg tgcagccaag cccttacgac atatgggcca ccgccgacct 8160 

ggtggagctg gttaagcagc gcattgaggt cacggatgga aggctacaag cggcctttgt 8220 
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cgtgtcgcgg gcgatcaaag gcacgcgcat 
gtacgagctg cccattcttg agtcccgtat 
cgccgccggc acaaccgttc ttgaatcaga 
ggcgctggcc gctgaaatta aatcaaaact 
5 agcaaaagca caaacacgct aagtgccggc 
acgttggcca gcctggcaga cacgccagcc 
gaggatcaca ccaagctgaa gatgtacgcg 
ctatctgaat acatcgcgca gctaccagag 
aattttagcg gctaaaggag gcggcatgga 

10 ggaatgcccc atgtgtggag gaacgggcgg 
ccggccctgc aatggcactg gaacccccaa 
ccatccggcc cggtacaaat cggcgcggcg 
gccgcgcagg ccgcccagcg gcaa.cgcatc 
aaggggccgc tgatcgaatc cgcaaagaat 

15 cgattaggaa gccgcccaag ggcgacgagc 
acgtgggcac ccgcgatagt cgcagcatca 
gtgaccgacg agctggcgag gtgatccgct 
ccgcagggcc ggccggcatg gccagtgtgt 
cccatctaac cgaatccatg aaccgatacc 

20 tgttccgtcc acacgttgcg gacgtactca 
agaaagacga cctggtagaa acctgcattc 
gtacgaagaa ggccaagaac ggccgcctgg 
gccgctacaa gatcgtaaag agcgaaaccg 
ctgattggat gtaccgcgag atcacagaag 

25 ccgattactt tttgatcgat cccggcatcg 
ccgcaggcaa ggcagaagcc , agatggttgt 
ccggagagtt caagaagttc tgtttcaccg 
cggagtacga tttgaaggag gaggcggggc 
gcaacctgat cgagggcgaa gcatccgccg 

30 aaattgccct agcaggggaa aaaggtcgaa 
ttgggaaccc aaagccgtac attgggaacc 
acattgggaa ccggtcacac atgtaagtga 
ccgcctaaaa ctctttaaaa cttattaaaa 
tgtctggcca gcgcacagcc gaagagctgc 

35 ccctacgccc cgccgcttcg cgtcggccta 
agatgcgtaa ggagaaaata ccgcatcagg 
ctgcgctcgg tcgttcggct gcggcgagcg 
ttatccacag aatcagggga taacgcagga 
gccaggaacc gtaaaaaggc cgcgttgctg 

40 gagcatcaca aaaatcgacg ctcaagtcag 
taccaggcgt ttccccctgg aagctccctc 
accggatacc tgtccgcctt tctcccttcg 
tgtaggtatc tcagttcggt gtaggtcgtt 
cccgttcagc* ccgaccgctg cgccttatcc 

45 agacacgact tatcgccact ggcagcagcc 
gtaggcggtg ctacagagtt cttgaagtgg 
gtatttggta tctgcgctct gctgaagcca 



cggcggtgag 


gttgccgagg 


cgctggccgg 


8280 


cacgcagcgc 


gtgagctacc 


caggcactgc 


8340 


acccgagggc 


gacgctgccc 


gcgaggtcca 


8400 


catttgagtt 


aatgaggtaa 


agagaaaatg 


8460 


cgtccgagcg 


cacgcagcag 


caaggctgca 


8520 


atgaagcggg 


tcaactttca 


gttgccggcg 


8580 


gtacgccaag 


gcaagaccat 


taccgagctg 


8640 


taaatgagca 


aatgaataaa 


tgagtagatg 


8700 


aaatcaagaa 


caaccaggca 


ccgacgccgt 


8760 


ttggccaggc 


gtaagcggct 


gggttgtctg 


8820 


gcccgaggaa 


tcggcgtgag 


cggtcgcaaa 


8880 


ctgggtgatg 


acctggtgga 


gaagttgaag 


8940 


gaggcagaag 


acgccccggt 


gaatcgtggc 


9000 


cccggcaacc 


gccggcagcc 


ggtgcgccgt 


9060 


aaccagattt 


tttcgttccg 


atgctctatg 


9120 


tggacgtggc 


cgttttccgt 


ctgtcgaagc 


9180 


acgagcttcc 


agacgggcac 


gtagaggttt 


9240 


gggattacga 


cctggtactg 


atggcggttt 


9300 


gggaagggaa 


gggagacaag 


cccggccgcg 


9360 


agttctgccg 


gcgagccgat 


ggcggaaagc 


9420 


ggttaaacac 


cacgcacgtt 


gccatgcagc 


9480 


tgacggtatc 


cgagggtgaa 


gccttgatta 


9540 


ggcggccgga 


gtacatcgag 


atcgagcta'g 


9600 


gcaagaaccc 


ggacgtgctg 


acggttcacc 


9660 


gccgttttct 


ctaccgcctg 


gcacgccgcg 


9720 


tcaagacgat 


ctacgaacgc 


agtggcagcg 


9780 


tgcgcaagct 


gatcgggtca 


aatgacctgc 


9840 


aggctggccc 


gatcctagtc 


atgcgctacc 


9900 


gttcctaatg 


tacggagcag 


atgctagggc 


9960 


aaggtctctt 


tcctgtggat 


agcacgtaca 


10020 


ggaacccgta 


cattgggaac 


ccaaagccgt 


10080 


ctgatataaa 


agagaaaaaa 


ggcgattttt • 


10140 


ctcttaaaac 


ccgcctggcc 


tgtgcataac 


10200 


aaaaagcgcc 


tacccttcgg 


tcgctgcgct 


10260 


tcgcggccta 


tgcggtgtga 


aataccgcac 


10320 


cgctcttccg 


cttcctcgct 


cactgactcg 


10380 


gtatcagctc 


actcaaaggc 


ggtaatacgg 


10440 


aagaacatgt 


gagcaaaagg 


ccagcaaaag 


10500 


gcgtttttcc 


ataggctccg 


cccccctgac 


10560 


aggtggcgaa 


acccgacagg 


actataaaga 


10620 


gtgcgctctc 


ctgttccgac 


cctgccgctt 


10680 


ggaagcgtgg 


cgctttctca 


tagctcacgc 


10740 


cgctccaagc 


tgggctgtgt 


gcacgaaccc 


10800 


ggtaactatc 


gtcttgagtc 


caacccggta 


10860' 


actggtaaca 


ggattagcag 


agcgaggtat 


10920 


tggcctaact 


acggctacac 


tagaaggaca 


10980 


gttaccttcg 


gaaaaagagt 


tggtagctct 


11040 
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tgatccggca aacaaaccac cgctggtagc 
acgcgcagaa aaaaaggatc tcaagaagat 
cagtggaacg aaaactcacg ttaagggatt 
tgtagggctt attatgcacg cttaaaaata 
5 • tgtgagcaat tatgtgctta gtgcatctaa 
gtcggcttga acgaatttct agctagacat 
ttcacgtagt ggacaaattc ttccaactga 
tgtccaagat aagcctgtct agcttcaagt 
tccattgccc agtcggcagc gacatccttc 

10 caaatgcggg acaacgtaag cactacattt 
ttccatagcg ttaaggtttc atttagcgcc 
aagagttcct ccgccgctgg acctaccaag 
aagatagcca gatcaatgtc gatcgtggct 
cgctgccatt ctccaaattg cagttcgcgc 

15 tcgtgcacaa caatggtgac ttctacagcg 
gaagtttcca aaaggtcgtt gatcaaagct 
accgtaacca gcaaatcaat * atcactgtgt 
tacaaatgta cggccagcaa cgtcggttcg 
gatagttgag tcgatacttc ggcgatcacc 

20 gggcgactgc cctgctgcgt aacatcgttg 
gcgtaacgcg cttgctgctt ggatgcccga 
taacaagcca tgaaaaccgc cactgcgttc 



ggtggttttt ttgtttgcaa gcagcagatt 11100 

cctttgatct tttctacggg gtctgacgct 11160 

ttggtcatgc atgatatatc tcccaatttg 11220 

ataaaagcag acttgacctg atagtttggc 11280 

cgcttgagtt aagccgcgcc gcgaagcggc 11340 

tatttgccga ctaccttggt gatctcgcct 11400 

tctgcgcgcg aggccaagcg atcttcttct 114 60 

atgacgggct gatactgggc cggcaggcgc 11520 

ggcgcgattt tgccggttac tgcgctgtac 11580 

cgctcatcgc cagcccagtc gggcggcgag 11640 

tcaaatagat cctgttcagg aaccggatca 11700 

gcaacgctat gttctcttgc ttttgtcagc 117 60 

ggctcgaaga tacctgcaag aatgtcattg 11820 

ttagctggat aacgccacgg aatgatgtcg 118 80 

cggagaatct cgctctctcc aggggaagcc 11940 

cgccgcgttg tttcatcaag ccttacggtc 12000 

ggcttcaggc cgccatccac tgcggagccg 12060 

agatggcgct cgatgacgcc- aactacctct 12120 

gcttccccca tgatgtttaa' ctttgtttta 121.80 

ctgctccata acatcaaaca tcgacccacg 12240 

ggcatagact gtaccccaaa aaaacagtca 12300 

catg ■ 12334 



25 <210> 16 

<211> 11385 

<212> DNA 

<213> STPT GUS ptx-P daao 



30 



<220> 
<221> 
<222> 
<223> 



.misc__feature 
(38) . . (183) 

Agrobacterium Right Border 



35 <220> 

<221> terminator 

<222> (384).. (639) 

<223> complementary: nos terminator 



40 <220> 

<221> misc_feature 

<222> (716) . . (1822) 

<223> complementary: coding for Rhodotorula gracilis DAAO 



45 <220> 
<221> 



promoter 



« 
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<222> (1866) . . (2728) 

<223> complementary: ptxA promoter 
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<220> 
<221> 
<222> 
<223> 



terminator 
(2818) . . (3022) 
complementary : 



35S terminator 



10 



<220> 
<221> 
<222> 
<223> 



miscjf eature 
(3097) . . (5094) 
complementary : 



coding for beta-glucuronidase (comprising intron) 



15 



<220> 
<221> 
<222> 
<223> 



promoter 
(5148) . . (6465) 

complementary: sTPT promoter 



20 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(6537) . . (6753) 
Agrobacterium .Left Border 



<400> 16 

25 aatattcaaa caaacacata cagcgcgact 
aaaccttttc acgccctttt aaatatccga 
tttacccgcc aatatatcct gtcaaacact 
atcagatctg gtaccttgcg ccgggtaccc 
ttaccctgtt atccctagtc gagcggccgc 

30 ttttagatca gcacactggc ggccgttact 
aacgactcag agcttgacag gaggcccgat 
atttatccta gtttgcgcgc tatattttgt. 
ggactctaat cataaaaacc catctcataa 
catgcttaac gtaattcaac agaaattata 

35 tcaatcttaa gaaactttat tgccaaatgt 
gcgggaactc cacgaaaata tccgaacgca 
acttcgactc ccgcgccgcg ccgtggtacc 
catcctccgc cgcgccccaa ctctgctggt 
caagcgtgac ctccttctcc ttcgccgctc 

40 actttgtccg gtcgagaggc aggacgatcc 
gtgcaggtcg caagccgacg ttgtggcgga 
tcgagatggt cgggtcgagg cgcaagcagt 
tgacagacaa gtcccagtct cccacgccgt 
gtcggggaat gatgtaggcg ggagaagcgg 

45 atggggactt gacgaggacg gtttgcccgc 
ccgcaatcga cttggcgcca agtcccgtag 
cctgctcaag cgacgtaacg gtccgtctct 



tatcatggac 


atacaaatgg 


acgaacggat 


60 


ttattctaat 


aaacgctctt 


ttctcttagg 


120 


gatagtttaa 


actgaaggcg 


ggaaacgaca 


180 


caaactgtct 


cacgacgttt 


tgaacccaga 


240 


cagtgtgatg 


gatatctgca 


gaattcgccc 


300 


agtggatcaa 


ttcactggcc 


gtcgttttac 


360 


ctagtaacat 


agatgacacc 


gcgcgcgata 


420 


tttctatcgc 


gtattaaatg 


tataattgcg 


480 


ataacgtcat 


gcattacatg 


ttaattatta 


540 


tgataatcat 


cgcaagaccg 


gcaacaggat 


600 


ttgaacgatc 


ggggatcatc 


cgggtctgtg 


660 


gcaagatcta 


gagcttgggt 


cccgcctaca 


720 


gctggaacgc 


ctcgtcgacg 


agctgcgcga 


780 


atcccgcact 


cgagaagcca 


tacgcatgca 


840 


gtgcgctgcc 


cctgccgagc 


gagaggggcg 


900 


gttctgcctc 


aacgcggggt 


ccgcctcgtc 


960 


ggacctcgat 


gccttcgatc 


gttccgtcgc 


1020 


gcttgaggat 


ccgctggacc 


gtctctgggt 


1080 


acgtcccgcc 


gcagatgact 


tcgccacctg 


1140 


ggtcggacga 


gtccatcgtg 


catcgcttgc 


1200 


ggattggctc 


ggcggcttgg 


tcgtcgatgc 


1260 


cgttgaccac 


caaatccgca 


ccgtcgaacg 


1320 


caaacgtcgc 


gccgagcttc 


tgcagctctc 


1380 
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ttgcaaggta ctggcagtac tttggtgcgt ggacggagag ggtgtcgtag gttacgccga 14 40 

tagcgccagg tggacattcg gaagatggga gggggcggta atttggcgtg atgtccttgt 1500 

accagtgccc gagcaagccg tcttcgttct gcgcgaaccg cctcgtcccc ttgagccaca 1560 

tggcatggcc cgtcgggacc aactcgaccc acttcttgaa agtcgattct tcccattttg 1620 

5 cttgtcgagg accgtctgta agcgtcatga aaggcgtcca attcgcgcca gcccatggtg 1680 

aagcgaaagt ctggctcgag acgtcctccg gcaagtcgcg cgcgagaata tgcacgctgt 1740 

agcccttccg agcgaggatg agggcgctgc tcagaccgat aacgcctgat ccgaggacaa 1800 

cgacgcgctt ctgcgagtgc atgggccctc gactagagtc gagatccgat atcgcccggg 1860 

ctcgagtttc gaagatttta gtgtaatgtg tgtgctcact actatgaagc tttgcactta 1920 

10 aaaaaataga atgagtgatg aggtttatat ggtgaaaaaa actatgaaat tttgatattt 1980 

tgatatatct ttctcgtgag tcatattcac ggaccatgtt gcagcaaatt ggaattaaac 2040 

tattcatttt ttatgttaaa tcattgattg atttttagtg ggcctcgtta catattcaag 2100 

agttagaatg aattcaaaca aactaggcca gaaaaaagga tgtggggcca tttttttgtg 2160 

tcttaagaat ttgtttattt ttttcatgga taaggggaat. caatggaaaa agtttgatgt 2220 

15 actagaggac atttttttaa catgtagtga caagtagtgc tattattcga cccgtgatga 2280 

aaggggcaat cttaatcttt ttttcataaa tctgcacatg tgatgcttta attatgcttt 2340 

agactttgtg ctaaactatt ggtaatttct ttttgtaatc gaatcaagta tcttttaaac 2400 

tatgtatgaa atgtgtcatc ctaaaaacaa cattttgcta gttttagact ttgatgttta 24 60 

tatgcttaat ggaagaagca atatgttgat gtttattggg taaaagaaag ggacttgatt 2520 

20 gagtatgtaa ttgacaacta tgattttata ttggatttga tattcctaac attaatttaa 2580 

gtgtgtgggt ttcaaagcat gttatgctag tgattcttgt gtttgatgct tgaaaaatct .2640 

acattcatcc ttgaatggag ggacaaactt .tgaatgactt ttgaataggt gtaaaatcca 2700 

atcctccctc agcttcacaa aaaattgcct cgaggtacct ggtagggata acagggtaat 27 60 

ctgggttcaa aacgtcgtga gacagtttgg tgcaggtcga aattcgagct cggtacccgg 2820 

25 tcactggatt ttggttttag gaattagaaa ttttattgat agaagtattt tacaaataca 2880 

aatacatact aagggtttct tatatgctca acacatgagc gaaaccctat aagaacccta 2940 

attcccttat ctgggaacta ctcacacatt attctggaga aaaatagaga gagatagatt 3000 

tgtagagaga gactggtgat ttttgcgccg ggtaccgagc tcggtagcaa ttcccgaggc 3060 

tgtagccgac gatggtgcgc caggagagtt gttgattcat tgtttgcctc cctgctgcgg 3120 

30 tttttcaccg aagttcatgc cagtccagcg tttttgcagc agaaaagccg ccgacttcgg 3180 

tttgcggtcg cgagtgaaga tccctttctt gttaccgcca acgcgcaata tgccttgcga 3240 

ggtcgcaaaa tcggcgaaat tccatacctg ttcaccgacg acggcgctga cgcgatcaaa 3300 

gacgcggtga tacatatcca gccatgcaca ctgatactct tcactccaca tgtcggtgta 3360 

cattgagtgc agcccggcta acgtatccac gccgtattcg gtgatgataa tcggctgatg 3420 

35 cagtttctcc tgccaggcca gaagttcttt ttccagtacc ttctctgccg tttccaaatc 3480 

gccgctttgg acataccatc cgtaataacg gttcaggcac agcacatcaa agagatcgct 3540 

gatggtatcg gtgtgagcgt cgcagaacat tacattgacg caggtgatcg gacgcgtcgg 3600 

gtcgagttta cgcgttgc£t ccgccagtgg cgaaatattc ccgtgcactt gcggacgggt 3660 

atccggttcg ttggcaatac tccacatcac cacgcttggg tggtttttgt cacgcgctat 3720 

40 cagctcttta atcgcctgta agtgcgcttg ctgagtttcc ccgttgactg cctcttcgct 3780 

gtacagttct ttcggcttgt tgcccgcttc gaaaccaatg cctaaagaga ggttaaagcc 3840 

gacagcagca gtttcatcaa tcaccacgat gccatgttca tctgcccagt cgagcatctc 3900 

ttcagcgtaa gggtaatgcg aggtacggta ggagttggcc ccaatccagt ccattaatgc 3960 

gtggtcgtgc accatcagca cgttatcgaa tcctttgcca cgtaagtccg catcttcatg 4020 

45 acgaccaaag ccagtaaagt agaacggttt gtggttaatc aggaactgtt cgcccttcac 4080 

tgccactgac cggatgccga cgcgaagcgg gtagatatca cactctgtct ggcttttggc 4140 

tgtgacgcac agttcataga gataaccttc acccggttgc cagaggtgcg gattcaccac 4200 
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ttgcaaagtc ccgctagtgc cttgtccagt 
ttcaacgctg acatcaccat tggccaccac 
ttgcgcgaca tgcgtcacca cggtgatatc 
cattacgctg cgatggattc cggcatagtt 
5 cttgccgttt tcgtcggtaa tcaccattcc 
gttcacacaa acggtgatac ctgcacatca 
taaattaaaa tatacacact tataaactac 
ttattttgaa aaaaatattt gaaatattat 
atatatcaaa ggtagaagca gaaacttacg 

10 gacatcggct tcaaatggcg tatagccgcc 
ccacactttg ccgtaatgag tgaccgcatc 
acctttcggt ataaagactt cgcgctgata 
tgcatcggcg aactgatcgt taaaactgcc 
cgcgctttcc caccaacgct gaccaattcc 

15 caggccgtcg agttttttga tttcacgggt 
tcgactctag actagtggat ccgatatcgc 
cagagttttg atagtgagag caaagaggga 
agatggtact tgttgatcgg acggctacga 
cggtgtatgg agttatggta gggttaatgg 

20 aattagataa gaatcttgag atactggttt 
ttgataagtg gatggttttt agagacacag 
ccatcatcat ctgtataatt ttgttttttt 
attttatttt tcattattta ccttgtatgt 
gattcacaag aaagtagatt ttttataaat 

25 tttaatttct tttgtttatg gttgtctgtc 
tcatatacgt tattctaggc ttttttgaat 
ataattttat caaagttata caactataac 
atcttctaat' tttttttgtt gtagccagtt 
acccacatgt attatgcgtt taactaggat 

30 ctatactctt tttttttttt tttctagttc 
ccaaattcaa ttaatttctg taattgcgat 
tcgatttgag ggaacccgag tttaaatatg 
gttatagttt atacttaata gtggtgtttt 
.attcttctta tgaagttgat tgagtgatta 

35 tggttgattg agtgattatt gtattagtat 
atttgttact cattcatcct tttgcatatg 
gagaacataa aagatatgtg tatttattta 
ctatacattt atatataact tgttgaattt 
tctagagtcg aggtaccgag ctcgaattca 

40 gcttggtaat aattgtcatt agattgtttt 
tttagacaag tatcaaacgg atgttaattc 
taagttgtct aagcgtcaat ttgtttacac 
agctccccga ccggcagctc ggcacaaaat 
tgagtaaaac agcttgc'gtc atgcggtcgc 

45 aatacgcaag gggaacgcat gaaggttatc 
aagacgacca tcgcaaccca tctagcccgc 
ttagtcgatt ccgatcccca gggcagtgcc 
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tgcaaccacc 


tgttgatccg 


catcacgcag 


4260 


ctgccagtca 


acagacgcgt 


ggttacagtc 


4320 


gtccacccag 


gtgttcggcg 


tggtgtagag 


4380 


aaagaaatca 


tggaagtaag 


actgcttttt 


4440 


cggcgggata 


gtctgccagt 


tcagttcgtt 


4500 


acaaattttg 


gtcatatatt 


agaaaagtta 


4560 


agaaaagcaa 


ttgctatata 


ctacattctt 


4620 


attactacta 


attaatgata 


attattatat 


4680 


tacacttttc 


ccggcaataa 


catacggcgt 


4740 


ctgatgctcc 


atcacttcct 


gattattgac 


4800 


gaaacgcagc 


acgatacgct 


ggcctgccca 


4860 


ccagacgttg 


cccgcataat 


tacgaatatc 


4920 


tggcacagca 


attgcccggc 


tttcttgtaa 


4980 


acagttttcg 


cgatccagac 


tgaatgccca 


5040 


tggggtttct 


acaggacgta 


ccatggtcga 


5100 


ccgggctcga 


ctctagatga 


aatcgaaatt 


5160 


cggacttatg 


aggatttcga 


gtatttcaag 


5220 


tgatctcgat 


ttggttaatc 


cagtatctcg 


5280 


tcaatttcat 


ctaacggtag 


agaatgatgt 


5340 


agattggatg 


agtgtagggt 


ccatcttatc 


5400 


tgaatattag 


ccaatcgaag 


ttccatatca 


5460 


ggaagataat 


aatgattgaa 


attttggtag 


5520 


tgagtggtct 


tcaaattatt 


gaacgtgaca 


5580 


gaaattttac 


ttattttaaa 


ggtatctcta 


5640 


agcatttgac 


ttgcagtttc 


atgctcatag 


5700 


atcttattac 


ttttttcgta 


atacaatttt 


5760 


taaaattagg 


gttttctaca 


aaacaaaaaa 


5820 


tactcgtaag 


ttacaaaaaa 


atacaaatga 


5880 


taccatgtac 


tttcatgtac 


tcaattcacc 


5940 


cacccaatct 


ataaaattct 


gtccatttga 


6000 


ttaaaattaa 


tattacatgt- 


tcactatttc 


6060 


ataaaaatgt 


tgacccatca 


ctacaaatat 


6120 


tggggataat 


tgatgaatta 


agtaaacatg 


6180 


ttgtatgtaa 


acctatgtga 


ttgatgttat 


6240 


gtaagcaaag 


atgattgttc 


ttatgaggta 


6300 


agaaattgtg 


ttagcgtacg 


caaaacaata 


6360 


aggtgact.tt 


tgttaatgat 


attgtagtat 


6420 


gagtataagc 


tatcaggatc 


cgggggatcc 


6480 


ctggccgtcg 


ttttacaacg 


actcagtact 


6540 


tatgcataga 


tgcactcgaa 


atcagccaat 


6600 


agtacattaa 


agacgtccgc 


aatgtgttat 


6660 


cacaatatat 


cctgccacca 


gccagccaac 


6720 


caccacgcgt 


ctaaaaaggt 


gatgtgtatt 


6780 


tgcgtatatg 


atgcgatgag 


taaataaaca 


6840 


gctgtactta 


accagaaagg 


cgggtcaggc 


6900 


gccctgcaac 


tcgccggggc 


cgatgttctg 


6960 


cgcgattggg 


cggccgtgcg 


ggaagatcaa 


7020 
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ccgctaaccg ttgtcggcat cgaccgcccg 
cggcgcgact tcgtagtgat cgacggagcg 
atcaaggcag ccgacttcgt gctgattccg 
accgccgacc tggtggagct ggttaagcag 
5 gcggcctttg tcgtgtcgcg ggcgatcaaa 
gcgctggccg ggtacgagct gcccattctt 
ccaggcactg ccgccgccgg cacaaccgtt 
cgcgaggtcc aggcgctggc cgctgaaatt 
aagagaaaat gagcaaaagc acaaacacgc 

10 gcaaggctgc aacgttggcc agcctggcag 
agttgccggc ggaggatcac accaagctga 
ttaccgagct gctatctgaa tacatcgcgc 
atgagtagat gaattttagc ggctaaagga 
accgacgccg tggaatgccc catgtgtgga 

15 tgggttgtct gccggccctg caatggcact 
gcggtcgcaa accatccggc ccggtacaaa 
agaagttgaa ggccgcgcag gccgcccagc 
tgaatcgtgg caaggggccg ctgatcgaat 
cggtgcgccg tcgattagga agccgcccaa 

20 gatgctctat gacgtgggca cccgcgatag 
tctgtcgaag cgtgaccgac gagctggcga 
cgtagaggtt tccgcagggc cggccggcat 
gatggcggtt tcccatctaa ccgaatccat 
gcccggccgc gtgttccgtc cacacgttgc 

25 tggcggaaag cagaaagacg acctggtaga 
tgccatgcag cgtacgaaga aggccaagaa 
agccttgatt agccgctaca agatcgtaaa 
gatcgagcta gctgattgga tgtaccgcga 
gacggttcac cccgattact ttttgatcga 

30 ggcacgccgc gccgcaggca aggcagaagc 
cagtggcagc gccggagagt tcaagaagtt 
aaatgacctg ccggagtacg atttgaagga 
catgcgctac cgcaacctga tcgagggcga 
gatgctaggg caaattgccc tagcagggga 

35 tagcacgtac attgggaacc caaagccgta 
cccaaagccg tacattggga accggtcaca 
aggcgatttt tccgcctaaa actctttaaa 
ctgtgcataa ctgtctggcc agcgcacagc 
gtcgctgcgc tccctacgcc ccgccgcttc 

40 aaataccgca cagatgcgta aggagaaaat 
tcactgactc gctgcgctcg gtcgttcggc 
cggtaatacg gttatccaca gaatcagggg 
gccagcaaaa ggccaggaac cgtaaaaagg 
gcccccctga cgagcatcac aaaaatcgac 

45 gactataaag ataccaggcg tttccccctg 
ccctgccgct taccggatac ctgtccgcct 
atagctcacg ctgtaggtat ctcagttcgg 



PCI7EP2005/002735 



acgattgacc 


gcgacgtgaa 


ggccatcggc 


7080 


ccccaggcgg 


cggacttggc 


tgtgtccgcg 


7140 


gtgcagccaa 


gcccttacga 


catatgggcc 


7200 


cgcattgagg 


tcacggatgg 


aaggctacaa 


7260 


ggcacgcgca 


tcggcggtga 


ggttgccgag 


7320 


gagtcccgta 


tcacgcagcg 


cgtgagctac 


7380 


cttgaatcag 


aacccgaggg 


cgacgctgcc 


7440 


aaatcaaaac 


tcatttgagt 


taatgaggta 


7500 


taagtgccgg 


ccgtccgagc 


gcacgcagca 


7560 


acacgccagc 


catgaagcgg 


gtcaactttc 


7620 


agatgtacgc 


ggtacgccaa 


ggcaagacca 


7680 


agctaccaga 


gtaaatgagc 


aaatgaataa 


7740 


ggcggcatgg 


aaaatcaaga 


acaaccaggc 


7800 


ggaacgggcg 


gttggccagg 


cgtaagcggc 


7860 


ggaaccccca 


agcccgagga 


atcggcgtga 


7920 


tcggcgcggc 


gctgggtgat 


gacctggtgg 


7980 


ggcaacgcat 


cgaggcagaa 


gacgccccgg 


8040 


ccgcaaagaa 


tcccggcaac 


cgccggcagc 


8100 


gggcgacgag 


caaccagatt 


ttttcgttcc 


8160 


tcgcagcatc 


atggacgtgg 


ccgttttccg 


8220 


ggtgatccgc 


tacgagcttc 


cagacgggca 


8280 


ggccagtgtg 


tgggattacg 


acctggtact 


8340 


gaaccgatac 


cgggaaggga 


agggagacaa 


8400 


ggacgtactc 


aagttctgcc 


ggcgagccga . 


8460 


aacctgcatt 


cggttaaaca 


ccacgcacgt 


8520 


cggccgcctg 


gtgacggtat 


ccgagggtga ■ 


8580 


gagcgaaacc 


gggcggccgg 


agtacatcga • 


8640 


gatcacagaa 


ggcaagaacc 


cggacgtgct . 


8700 


tcccggcatc 


ggccgttttc 


tctaccgcct 


8760 


cagatggttg 


ttcaagacga 


tctacgaacg 


8820 


ctgtttcacc 


gtgcgcaagc 


tgatcgggtc 


8880 


ggaggcgggg 


caggctggcc 


cgatcctagt 


8940 


agcatccgcc 


ggttcctaat 


gtacggagca 


9000 


aaaaggtcga 


aaaggtctct 


ttcctgtgga 


9060 


cattgggaac 


cggaacccgt 


acattgggaa 


9120 


catgtaagtg 


actgatataa 


aagagaaaaa 


9180 


acttattaaa 


actcttaaaa 


cccgcctggc 


9240 


cgaagagctg 


caaaaagcgc 


ctacccttcg 


9300 


gcgtcggcct 


atcgcggcct 


atgcggtgtg 


9360 


accgcatcag 


gcgctcttcc 


gcttcctcgc 


9420 


tgcggcgagc 


ggtatcagct 


cactcaaagg 


9480 


ataacgcagg 


aaagaacatg 


tgagcaaaag 


9540 


ccgcgttgct 


ggcgtttttc 


cataggctcc 


9600 


gctcaagtca 


gaggtggcga 


aacccgacag 


9660 


gaagctccct 


cgtgcgctct 


cctgttccga 


9720 


ttctcccttc 


gggaagcgtg 


gcgctttctc 


9780 


tgtaggtcgt 


tcgctccaag 


ctgggctgtg 


9840 
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tgcacgaacc ccccgttcag 
ccaacccggt aagacacgac 
gagcgaggta tgtaggcggt 
ctagaaggac agtatttggt 
5 ttggtagctc ttgatccggc 
agcagcagat tacgcgcaga 
ggtctgacgc tcagtggaac 
ctcccaattt gtgtagggct 
gatagtttgg ctgtgagcaa 

10 cgcgaagcgg cgtcggcttg 
tgatctcgcc tttcacgtag 
gatcttcttc ttgtccaaga 
ccggcaggcg ctccattgcc 
ctgcgctgta ccaaatgcgg 

15 cgggcggcga gttccatagc 
gaaccggatc aaagagttcc 
cttttgtcag caagatagcc 
gaatgtcatt gcgctgccat 
gaatgatgrtc gtcgtgcaca 

20 caggggaagc cgaagtttcc 
gccttacggt caccgtaacc 
ctgcggagcc gtacaaatgt 
caactacctc tgatagttga 
actttgtttt agggcgactg 

25 atcgacccac ggcgtaacgc 
aaaaacagtc ataacaagcc 



34/34 

cccgaccgct gcgccttatc 
ttatcgccac tggcagcagc 
gctacagagt tcttgaagtg 
atctgcgctc tgctgaagcc 
aaacaaacca ccgctggtag 
aaaaaaggat ctcaagaaga 
gaaaactcac gttaagggat 
tattatgcac gcttaaaaat 
ttatgtgctt agtgcatcta 
aacgaatttc tagctagaca 
tggacaaatt cttccaactg 
taagcctgtc tagcttcaag 
cagtcggcag cgacatcctt 
gacaacgtaa gcactacatt 
gttaaggttt catttagcgc 
tccgccgctg gacctaccaa 
agatcaatgt cgatcgtggc 
tctccaaatt gcagttcgcg 
acaatggtga cttctacagc 
aaaaggtcgt tgatcaaagc 
agcaaatcaa tatcactgtg 
acggccagca acgtcggttc 
gtcgatactt cggcgatcac 
ccctgctgcg taacatcgtt 
gcttgctgct tggatgcccg 
atgaaaaccg ccactgcgtt 



PCT/EP2005/002735 

cggtaactat cgtcttgagt 9900 
cactggtaac aggattagca 9960 

gtggcctaac tacggctaca 10020 

agttaccttc ggaaaaagag 10080 

cggtggtttt tttgtttgca 10140 

tcctttgatc ttttctacgg 10200 

tttggtcatg catgatatat 10260 

aataaaagca gacttgacct 10320 

acgcttgagt taagccgcgc 10380 

ttatttgccg actaccttgg 10440 

atctgcgcgc gaggccaagc 10500 

tatgacgggc tgatactggg 10560 

cggcgcgatt ttgccggtta 10620 

tcgctcatcg ccagcccagt 10 680 

ctcaaataga tcctgttcag 10740 

ggcaacgcta tgttctcttg 108 00 

tggctcgaag atacctgcaa 108 60 

cttagctgga taacgccacg 10920 

gcggagaatc tcgctctctc 10980 

tcgccgcgtt gtttcatcaa 11040 

tggcttcagg ccgccatcca 11100 

gagatggcgc tcgatgacgc 11160 

cgcttccccc atgatgttta 11220 

gctgctccat aacatcaaac 11280 

aggcatagac tgtaccccaa 1134 0 

ccatg 11385 



